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Abstract 

We summarize a book under publication with the above title writ- 
ten by the three present authors, on the theory of Zipf's law, and 
more generally of power laws, driven by the mechanism of proportional 
growth. The preprint is available upon request from the authors. 

For clarity, consistence of language and conciseness, we discuss the 
origin and conditions of the validity of Zipf's law using the terminol- 
ogy of firms' asset values. We use firms at the entities whose size 
distributions are to be explained. It should be noted, however, that 
most of the relations discussed in this book, especially the intimate 
connection between Zipf's and Gilbrat's laws, underlie Zipf's law in 
diverse scientific areas. The same models and variations thereof can 
be straightforwardly applied to any of the other domains of application. 
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Executive summary 

Zipf's law is one of the few quantitative reproducible regularities found 
in economics. It states that, for most countries, the size distributions of city 
sizes and of firms (with additional examples found in many other scientific 
fields) are power laws with a specific exponent: the number of cities and of 
firms with size greater than S is inversely proportional to S. 

Most explanations start with Gibrat's law of proportional growth but 
need to incorporate additional constraints and ingredients introducing devi- 
ations from it. 

Here, we present a general theoretical derivation of Zipf's law, provid- 
ing a synthesis and extension of previous approaches. First, we show that 
combining Gibrat's law at all firm levels with random processes of firms' 
births and deaths yield Zipf's law under a "balance" condition between firm 
growth and their death rate. 

We find that Gibrat's law of proportionate growth does not need to be 
strictly satisfied. As long as the volatility of firms' sizes increases asymp- 
totically proportionally to the size of the firm and that the instantaneous 
growth rate increases not faster than the volatility, the distribution of firm 
sizes follows Zipf's law. This suggests that the occurrence of very large 
firms in the distribution of firm sizes described by Zipf's law is more a con- 
sequence of random growth than systematic returns: in particular for large 
firms, volatility must dominate over the instantaneous growth rate. 

We develop the theoretical framework to take into account 

1. time- varying firm creation, 

2. firms' exit resulting from both a lack of sufficient capital and sudden 
external shocks, and 

3. the coupling between firms' birth rate and the growth of the value of 
the population of firms. 

We predict deviations from Zipf's law under a variety of circumstances, 
for instance when the balance between the birth rate, the non-stochastic 
growth rate and the death rate is not fulfilled, providing a framework for 
identifying the possible origin(s) of the many reports of deviations from the 
pure Zipf's law. The tail index that characterizes the hyperbolic decay of 
the distribution is found to depend on several characteristics of the economic 
environment. Amongst others, the average growth rate of firms' asset value, 
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the rate of firms' birth and the hazard rate of a firm's sudden death have a 
direct impact on the value of the tail index. 

Reciprocally, deviations from Zipf's law in a given economy provides a 
diagnostic, suggesting possible policy corrections. The results obtained here 
are general and provide an underpinning for understanding and quantifying 
Zipf's law and the power law distribution of sizes found in many fields. 

A general result unraveled by our study is that Zipf's law is obtained if 
and only if a balanced condition is fulfilled: the sum of all the mechanisms 
responsible for the growth and/or decline of firms must vanish on average. 
Any departure from this requirement yields a departure of the tail index 
from its canonical value m = 1. This result can allow one to understand 
why different tail indexes are reported in the literature for different coun- 
tries around the world. However, the reasons that underpin the validity of 
the balance condition are not yet clear. No economic law can justify why all 
these mechanisms should almost exactly compensate one another. In the ab- 
sence of such economic argument, one has to resort to Gabaix's explanation 
based upon the idea that, in order to make stationary the distribution of 
firm's sizes, one has to first remove the impact of the overall economy on the 
growth of each individual firm. Therefore, since the overall economy grows 
at the same rate as each individual firm, on average, the balance condition 
is satisfied in the referential of the growing economy. 

1 Motivations and organization of the book 

One of the broadly accepted universal laws of complex systems, partic- 
ularly relevant in social sciences and economics, is that proposed by Zipf 
(1949). Zipf's law usually refers to the fact that the survival probability 
P(s) = Pr{S* > s} that the value S of some stochastic variable, usually a size 
or frequency, is greater than s, decays with the growth of s as P(s) ~ s _1 . 
This in turn means that the probability density functions p(s) exhibits the 
power law dependence 

p(s) ~ l/s 1+m with m = 1 . (1) 

Perhaps the distribution most studied from the perspective of Zipf's law is 
that of firm sizes, where size is proxied by sales, income, number of employ- 
ees, or total assets. Many studies have confirmed the validity of Zipf's law 
for firm sizes existing at current time t and estimated with these different 
measures (Simon and Bonini 1958, Ijri and Simon 1977, Sutton 1997, Axtell 
2001, Okuyama et al. 1999, Gaffeo et al. 2003, Aoyama et al. 2004, Fujiwara 
et al. 2004, Fujiwara et al. 2004). Initially formulated as a rank- frequency 
relationship quantifying the relative commonness of words in natural lan- 
guages (Zipf 1949), Zipf's law accounts remarkably well for the distribution 
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of city sizes (Gabaix 1999) as well as firm sizes all over the world, as just men- 
tioned. Recently, Zipf 's law has also been found in Web access statistics and 
Internet traffic characteristics (Adamic and Huberman 2000, Barabasi and 
Albert 2002) as well as in bibliometrics, informetrics, scientometrics, and 
library science (see (Adamic and Huberman 2002) and references therein). 
There are also suggestions for applications to other physical and biolog- 
ical, sociological and financial market processes (see list of references in 
http : / /linkage . rockefeller . edu/wli/zipf /index_ru . html). Figure [1] 
illustrates several applications of Zipf's law to different fields of social and 
natural sciences. 




Figure 1: Illustration of Zipf's law for city sizes (upper left panel, repro- 
duced from Ioannides and Gabaix (2003)), for firm sizes (upper right panel, 
reproduced from Axtell (2001)), for the number of Internet links pointing 
to some website (lower left panel, reproduced from Adamic and Huberman 
(2002)) and for the number of incoming links to packages found in differ- 
ent Linux open source software releases (lower right panel, reproduced from 
Maillart et al. (2008)). 

Among the many more or less successful explanations proposed to under- 
stand the origin of Zipf's law, one of the most promising is the explanation 
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by Gabaix (1999) and Ioannides and Gabaix (2003) formulated in the con- 
text of the distribution of city sizes, based on Gibrat's law. Gabaix (1999) 
assumed that each city exhibits a stochastic growth rate distributed inde- 
pendently from its present size. Gabaix (1999) showed that Gibrat's law for 
city growth (together with some important deviations of Gibrat's law), nor- 
malized to the whole population of a given country, leads to distributions of 
city sizes very close to Zipf's law. However, the derivation of Gabaix (1999) 
suffers from a few problems. 

First, the exact scale-independent Gibrat's law leads to a log-normal 
distribution of city sizes, which is not strictly a power law and only slowly 
converges to a power law in the limit of large log-variance (and some other 
conditions), becoming at the same time more and more degenerate. Some 
additional assumptions are therefore needed in order to produce the stable 
non-degenerate Zipf's law. In particular, Gabaix (1999) assumed that, for 
cities of small sizes, there are some exogenous factors preventing further 
decaying of their population (see also (Levy and Solomon 1996, Malcai et 
al. 1999)). More appropriate to social and economic phenomena is the sup- 
position, contrary to preventing population decay, of eliminating cities or 
firms as they reach a small size. An example is the transition from city 
to rank of village as the size goes below some threshold. In the context of 
an economy of firms, it is important to take into account the continuous 
process with births and deaths playing a central role at time scales as short 
as a few years. A goal of the present book is to demonstrate that death (as 
well as birth) processes are especially important to understand the economic 
foundation of Zipf's law and its robustness. We will consider two different 
mechanisms for the exit of a firm: (i) when the firm total asset value be- 
comes smaller than a given minimum threshold (which can vary with time 
and with countries) and (ii) when an exogenous shock occurs, modeling for 
instance operational risks, independently of the size of the firm. 

Another shortcoming of Gabaix's approach is the simplifying supposition 
that all cities originate at the same instant to, and then only grow stochasti- 
cally, obeying the balanced Gibrat's law mentioned above. We believe that 
it is more realistic, especially for the description of the behavior of the asset 
value of firms (which is more dynamic than the formation of cities), that 
the births of firms occur according to a random point process characterized 
by some mean rate v(t). Jointly, one should take into account the well- 
documented evidence that firms die, for instance when their size go under 
some low asset value level. It turns out that taking into account the random 
flow of firm births and deaths, in combination with Gibrat's law, leads to the 
pure and non-degenerate Zipf's law, without the need to the rather artificial 
modification of Zipf's law for small sizes [We note that the fact that devia- 
tion of Gibrat's law has been documented for small firms is another issue, as 
the documented deviations do not necessarily obey the assumptions needed 
in Gabaix's derivation.] As a bonus, the approach in terms of the dynamics 
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of birth-death together with stochastic growth, that we develop here, leads 
to specific predictions of the conditions under which deviations from Zipf 's 
law occur, which help rationalize the empirical evidence documented in the 
literature. The conditions involve either deviations from Gibrat's law in the 
stochastic growth process of firms or the existence of an unbalanced growth 
or decay of the mean birth rate v(t) of new firms, as we explain in details 
below. 

For transparency of derivations and for convenience of analytic calcula- 
tions, we use a continuous version of Gibrat's law, allowing us to benefit 
from the properties of the Wiener process and the mathematical framework 
of Kolmogorov's diffusion equations. We unearth new properties associ- 
ated with the stochastic behavior of firm assets. We show that the death 
of firms at some low value level as well as possibly significant deviations 
from Gibrat's law do not affect the asymptotic validity of Zipf's law in the 
limit of large firm sizes. By analyzing a large class of diffusion processes 
modeling the behavior of firm assets with growth rates very different from 
Gibrat's condition, we find general conditions for the validity of Zipf's law. 
Specifically, we have discovered stochastic growth models with non-Gibrat 
properties, leading to Zipf's and related power laws for the current density 
of firms' asset values. 

The book is organized as follows. Chapter 2 presents the continuous 
version of Gibrat's law and some peculiarities of the stochastic behavior of 
the geometric Brownian motion of firms' asset values, resulting from Gibrat's 
law. 

Chapter 3 describes the proposed model for the current density of firms' 
asset values, taking into account the random flow of the birth of firms. We 
show that, if some natural balance condition holds, which is analogous to 
Gabaix (1999) normalizing condition, while the mean birth rate of firms is 
independent of time (y = const), then the exact Zipf's law holds true. 

Amazingly, despite the relevance of Gibrat's law and the corresponding 
geometric Brownian motion in a wide range of physical, biological, sociolog- 
ical and other applications, many researchers do not make use of many of 
the interesting properties exhibited by realizations of the geometric Brow- 
nian motion, in order to derive detailed explanations of Zipf's and related 
power laws. Thus, in chapter 4, we gather little-known information con- 
cerning the statistical properties of realizations of the geometric Brownian 
motion, which play a significant role for the understanding of the roots and 
conditions of the validity of Zipf's law. 

Chapter 5 discusses in detail the influence on the validity of Zipf's law 
of the occurrence of the death of firms when their value falls below some 
low level. In chapter 6, we derive an equation for the steady-state density 
of firm asset values, which enables us to explore in detail the consequences 
of deviations from Gibrat's law at moderate asset values on the validity of 
Zipf's law at higher asset values. 
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Chapters 7 and 8 are devoted to discussing possible deviations from 
Zipf's law due respectively to the sudden death of firms and the time de- 
pendence of the birth rate. It is shown that, even in such situations, Zipf's 
law holds if some generalized balance condition is valid. In particular, we 
discuss the robustness of Zipf's law to variations of the mean birth rate and 
of the rate of growth of the mean asset value of particular firms. The second 
part of Section 8 presents a simple coupled model describing the possible 
connection between the stochastic behavior of firms' asset values and the 
mean birth rate. 

In addition to the mechanisms in terms of birth, death and random 
growth which have been considered in the previous chapters, we envision 
that the next level of development of a complete mathematical theory of 
firms needs to take into account the mechanism of mergers between firms 
(referred to as M&A for "merger and acquisition"), as well as it symmetric, 
the phenomenon of creation of spin-off firms created from parent firms which 
privatize a part of their existing business as separate units. For this, the long 
tradition in physics concerning the investigation of the processes of coagula- 
tion (merger) and of fragmentation (spin-off) could provide a fertile reservoir 
of ideas and techniques (Aldous 1999, Leyvraz 2003). Chapter 9 presents 
the integro-differential equation that expresses the coupling between firms 
introduced by M&A and spinoffs and provides preliminary results. This 
section is more an appetizer and encouragement for future works than a 
complete treatment. 
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